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(57) ABSTRACT 

A system and method for determining which processor is to 
be the master processor in a symmetric multi-processor 
(SMP) environment. The determination is made by boot- 
level code, i.e. the software program which executes first in 
a processor after it is brought on-line. Each processor in the 
SMP system is brought on-line independently of the other 
processors in the system, and each processor in the system 
can uniquely identify itself. As a processor comes on-line, it 
checks to see if a master processor has already been desig- 
nated. If not, the processor checks to see if another 
processor, with a higher priority identifier, has identified 
itself as a working processor. If so, the processor commits to 
being a slave processor. If not, the processor indicates that 
it is available to be the master processor. A further check is 
made to ensure that only one processor has indicated that it 
is available to become the master processor. The likelihood 
that a particular processor will become the master processor 
may be increased or decreased by changing the processor's 
unique identifier. Thus, the system and method is control- 
lable and changeable, without the need for any special or 
dedicated hardware. The probability that the system will 
always successfully select a master processor is increased, as 
is the probability that the system will initialize successfully 
even if one or more of the processors malfunctions. 

12 Claims, 3 Drawing Sheets 



| Wnti i trio [grayed 
| Readpthwyafl 




Contftttobt i 






stara processor 




itema&wl 



| Sim a! demits ctfaf^- 100 



SditomaScdtozsro 



X 



£1 



Cumfl to bfl resstsi 
processor ml wrSe a 



05/21/2004, EAST Version: 1.4.1 



U.S. Patent Jan. 23, 2001 Sheet 1 of 3 US 6,178,445 Bl 




05/21/2004, EAST Version: 1.4.1 



U.S. Patent Jan. 23,2001 Sheet 2 of 3 US 6,178,445 Bl 



50 



L 



CellO Celll Cell2 Cell3 Cell4 Cell5 Cell6 Cell7 Cell8 



Primary Cells 



52 



Alternate Cells 

54 



56 



FIG. 2 



Physical 


Logical 


Processor 


Identifier 


0 


0 


1 


1 


2 


2 


3 


3 



Physical 


Logical 
Identifier 


Processor 


0 


2 


1 


3 


2 


0 


3 


1 



FIG. 4A 



FIG. 4B 



05/21/2004, EAST Version: 1.4.1 



U.S. Patent Jan. 23, 2001 Sheet 3 of 3 US 6,178,445 Bl 



Write a one into primary cell 

1 



Read primary cell 



80 



82 



FIG. 3 



Read the global "master chosen" flag 



84 



21 



Commit to be a 
slave processor 




90 



92 



Sum primary cells from 0 to own primary cell 





YES 



NO 



94 




Sum 
>1? 



Commit to be a 
slave processor 



r 



96 



Write a one into its 
alternate cell 



T 



Read alternate cell 
i 



Sum all alternate cells 

V 



98 



■100 



104 




Set alternate cell to zero 



106 



Read alternate cell 



108 



Commit to be master 
processor and write a 
one into global "master 
chosen" flag cell 



05/21/2004, EAST Version: 1.4.1 



US 6,1 

1 

SYSTEM AND METHOD FOR 
DETERMINING WHICH PROCESSOR IS 
THE MASTER PROCESSOR IN A 
SYMMETRIC MULTI PROCESSOR 
ENVIRONMENT 

FIELD OF THE INVENTION 

The present invention relates to information handling 
systems, and, more particularly, to a system and method for 
determining which processor is to be the master processor in 
a symmetric multi-processor environment 

BACKGROUND OF THE INVENTION 

In a symmetric multi-processor (SMP) system, a number 
of processors are connected together by means of a system 
bus. The system bus may be used by only one processor at 
a time. Other devices, such as storage and I/O devices, are 
also connected to the system bus. These devices, and other 
system resources, such as clocks, timers, direct memory 
access, and memory refresh controls, are shared by the 
processors. 

During system reset, sometimes referred to as power-on- 
reset (POR), one of the processors is designated as the 
master processor. The master processor typically takes con- 
trol of the system bus, and then initializes each of the other 
processors. The master processor also initializes the other 
devices in the system, and coordinates the starting of the 
entire system. For example, the master processor may test 
various parts of the system, such as memory areas. After 
system initialization, the master processor may designate a 
different processor as the master processor, or may remain 
the master processor. During operation of the system, the 
master processor may assign work to the other processors. 

When a system is powered on, one of the processors is 
designated as the master processor. Prior art methods for 
determining which processor is to be the master processor 
are typically hardware-based methods, which can not be 
changed or controlled without making a hardware change. 
For example, in "Method to Determine a Lead Processor in 
a Symmetric Multi-Processor System," IBM Technical Dis- 
closure Bulletin, Vol. 39, No. 05, May 1996, pp. 53-54, each 
processor has an equal chance to become the master. A 
system bus arbiter determines which processor is granted 
access to the bus in order to read a first access register. Thus, 
the master selection process can not be controlled or 
changed without changing the hardware. 

Another prior art method, "Start- Up Master Processor 
Selection Method For Multi-Processor Systems/' IBM 
Technical Disclosure Bulletin, Vol. 33, No. 4, September 
1990, pp. 375-376, discloses a method based on start-up 
circuitry in a system resource unit. The start-up circuitry 
polls each processor sequentially in order to find the first 
processor capable of becoming the master. The master 
processor then informs the other processor units that they are 
slave processors. The master processor selection can only be 
controlled by changing the hardware wiring, as the master 
processor is always the first processor to be polled. Further, 
this method may not always result in a successful system 
initialization. If one of the processing units malfunctions, 
and fails to stop driving the Ready line, the start-up circuitry 
will never start, and the system will not come up. 

Consequently, there is a need for a system and method for 
determining which processor is to be the master processor in 
an SMP environment. It would be desirable for the system 
and method to be controllable and changeable, without the 
need for any special or dedicated hardware. It would also be 
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desirable to increase the probability that the system will 
always successfully select a master processor, and will 
initialize successfully even if one or more of the processors 
malfunctions. 

5 

SUMMARY OF THE INVENTION 

Accordingly, the present invention is directed to a system 
and method for determining which processor is to be the 
master processor in a symmetric multi-processor (SMP) 

30 environment. The determination is made by boot-level code, 
i.e. the software program which executes first in a processor 
after it is brought on-line. Each processor in the SMP system 
is brought on-line independently of the other processors in 
the system, and each processor in the system can uniquely 

15 identify itself. As each processor comes on-line, it executes 
a series of steps. All processors in the system may be 
executing the steps concurrently. 
As a processor comes on-line, it checks to see if a master 

M processor has already been designated. If not, the processor 
checks to sec if another processor, with a higher priority 
identifier, has identified itself as a working processor. If so, 
the processor commits to being a slave processor. If not, the 
processor indicates that it is available to be the master 

^ processor. A further check is made to ensure that only one 
processor has indicated that it is available to become the 
master processor. If the processor determines that more than 
one processor has indicated its availability to become the 
master processor, the processor de-commits, and returns to 

30 the step of checking to see if a master processor has already 
been designated. 

The system and method of the present invention is 
controllable, and may be changed without changing the 
system hardware. Because each processor has a priority 

35 based upon its unique identifier, changes may be made by 
mapping each physical processor to a different logical 
identifier. The mapping may be stored in a non-volatile 
memory area. 

An advantage of the present invention is that the likeli- 
40 hood that a particular processor will become the master 
processor may be increased or decreased by changing the 
processor's unique identifier. Thus, the system and method 
is controllable and changeable, without the need for any 
special or dedicated hardware. Another advantage of the 
45 present invention is that it increases the probability that the 
system will always successfully select a master processor, 
and will initialize successfully even if one or more of the 
processors malfunctions. 

50 BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other features and advantages of the 
present invention will become more apparent from the 
detailed description of the best mode for carrying out the 
invention as rendered below. In the description to follow, 

55 reference will be made to the accompanying drawings, 
where like reference numerals are used to identify like parts 
in the various views and in which: 

FIG. 1 is a block diagram of an information handling 

60 system capable of executing the method of the present 
invention; 

FIG. 2 is a block diagram depicting memory cells in a 
memory area of the information handling system of FIG. 1; 

FIG. 3 is a flow chart illustrating a method for executing 
65 the present invention; and 

FIGS. 4A and 4B are block diagrams of a mapping table 
used to map physical processors to logical identifiers. 
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DETAILED DESCRIPTION OF A PREFERRED CellO, celll, ceD2, and celD are referred to as primary cells 

EMBODIMENT OF THE INVENTION 52. Cell4, cellS, cell6, and ceU7 are referred to as alternate 

The invention may be implemented on a variety of celk 54. Cell8 is a global "master chosen" flag cell 56. When 

hardware platforms, including personal computers, svslem * powered on, a memory controller -preferably 

workstations, embedded systems, mini-computers, and 5 initializes each of memory cells 50 to zero. Each processor 

mainframe computers. Referring now to FIG. 1, a typical ™ system is associated with one primary cell 52 and one 

configuration of an information handling system that may be alternate cell 54. For example, processor 0 is associated with 

used to practice the novel method of the present invention P rimar y ^ 31x6 alternate cell4. Processor 1 is associated 

will be described. The computer system of FIG. 1 has at least Wlth pnmary celll and alternate cellS, and so on. 

one processor 10. Processor 10 is interconnected via system 10 As each processor is brought on-line, it executes the steps 

bus 12 to random access memory (RAM) 16, read only depicted in FIG. 3. Note that each processor begins execut- 

memory (ROM) 14, and input/output (I/O) adapter 18 for ing the steps as the processor is brought online. It is not 

connecting peripheral devices such as disk units 20 and tape necessary for one processor to complete execution of all the 

drives 40 to bus 12, user interface adapter 22 for connecting steps before another processor can start execution. Thus, as 

keyboard 24, mouse 26 having buttons 17a and 17b, speaker 5 the processors come on-line, they will be concurrently 

28, microphone 32, and/or other user interface devices such executing the steps depicted in FIG. 3. 

as a touch screen device 29 to bus 12, communication Referring now to FIG. 3, a method of implementing the 

adapter 34 for connecting the information handling system pre sent invention will now be described. When a processor 

to a data processing network, and display adapter 36 for comes on .ii ne> it writes a one into its primary cell (step 80). 

connecting bus 12 to display device 38. M For cxamp i Cj whcn processor 0 comes online, it writes a one 

Communication adapter 34 may link the system depicted m to cellO. The processor then reads back its own primary 

in FIG. 1 with hundreds or even thousands of similar cc n to guarantee that the write has completed (step 82). A 

systems, or other devices, such as remote printers, remote working processor is a processor which has successfully 

servers, or remote storage units. The system depicted in FIG. storec } a va i ue 0 f one j nto primary cell. 

1 may be linked to both local area networks (sometimes ^ The processor next reads global "master chosen" flag cell 

referred to as Intranets) and wide area networks, such as the 56 ( slep ^ jf tne value of flag ^ 56 ^ aot zer0 ( slep 

Internet. mc proccssor commits to being a slave processor (step 88), 

The present invention is a system and method for allow- anc i j s no longer involved in the master selection process. If, 

ing boot-level code to determine which processor is to be the however, flag cell 56 is equal to zero (i.e. the answer to the 

master processor in a symmetric multi-processor (SMP) 30 question in step 86 is "yes"), the processor continues on in 

environment. Boot-level code is the software, or program, the master selection process. The processor next sums 

which executes first in a processor after the processor is primary cells 52 from cellO up to and including its own 

brought on-line. Note that boot-level code is sometimes primary cell (step 90). For example, processor 2 would add 

referred to as firmware or as Power-On-Reset (POR) code. together the contents of cello, celll, and cell2. The processor 

Because the determination as to which processor is to be the 3S then determines if the sum is greater than one (step 92). If 

master processor is made by the boot-level code, it is easier me sum ^ grea ter than one, this indicates that some other 

to change the determination than in prior hardware-based working processor has previously written to its primary cell, 

methods. As will be described below, with reference to p or example, if processor 2 obtains a result of two from 

FIGS. 4A and 4B, the determination as to which processor summing cellO, celll, and cell2, then this indicates that 

is most likely to become the master processor may be 40 either processor 0 or processor 1 has previously written to its 

changed via software, and requires no specialized hardware primary cell. Thus, the processor will commit to be a slave 

or hardware changes. In addition, the present system and processor (step 94), and will no longer be involved in the 

method will work with or without cache and memory master selection process. Note that steps 90 through 94 in 

coherency hardware, so long as a memory controller with effect assign a higher priority to lower numbered processors, 

consistency support is used. 45 j n other words, if, during the summing of primary cells (step 

The present invention may be implemented in an SMP 90), a processor finds that a lower numbered processor has 
system which includes any number of processors. The come on-line and is a working processor (i.e. has success- 
processors are brought on-line independently of each other. fully written a one into its primary cell), the processor 
For example, they may be randomly released, or released at (which is a higher numbered processor) removes itself from 
staggered intervals. Each processor can uniquely identify 50 the master selection process. 

itself. As will be discussed below, with reference to FIGS. [f th e summing in step 90 results in a number which is not 

4A and 4B, a processor's identity may be changed through greater than one (i.e. the answer to the question in step 92 is 

the use of a mapping table. A number of contiguous memory "no"), then the processor continues on in the master selec- 

cells, stored in RAM 16, are preferably used to implement tion process. The processor writes a one into its alternate 

the method of the present invention. 55 memory cell (step 96). For example, in a four-processor 

Referring now to FIG. 2, an example configuration of system, processor 2 would write a one into cell6. The 
memory cells used to implement the present invention will processor then reads its own alternate memory cell to ensure 
be described. The number of memory cells used is (2n+l), that the write completed (step 98). The processor then sums 
where "n" is the number of processors in the SMP system. all the alternate cells (step 100). For example, in a four- 
For example, if there are three processors in the system, 60 processor system, processor 2 would add together the con- 
seven memory cells are used. Similarly, if there are four tents of cel!4, cell5, cell6, and cell7. The processor then 
processors in the system, nine memory cells are used. Note checks the result (step 102). If it is greater than one, the 
that a memory cell may be any defined memory area, such processor sets its alternate memory cell to zero (step 104), 
as a byte, half-word, word, etc., and the memory cells are reads its alternate memory cell to ensure that the write has 
preferably contiguous. 65 completed (step 106), and returns to step 84. If the sum is not 

The example depicted in FIG. 2 includes nine memory greater than one (i.e. the answer to the question in slep 102 

cells 50 for use in an SMP system with four processors. is "no"), then the processor commits to be the master 
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processor, and writes a one into global "master chosen" flag 
cell 56 (step 108). 

The present invention insures that one processor, and only 
one processor, will become the master processor in an SMP 
system. As discussed above, with reference to steps 90 
through 94 of FIG. 3, the present invention effectively 
assigns a higher priority to lower numbered processors. 
However, if the processor with the lowest number fails, 
another processor will become the master processor. Thus, 
the present invention increases the probability that the 
system will successfully select a master processor, and that 
the system will initialize successfully even if one or more of 
the processors malfunctions. 

The system and method of the present invention is 
controllable, and may be changed without changing the 
system hardware. Because each processor has a priority 
which is based on its unique identifier, changes may be made 
by mapping each physical processor to a different logical 
identifier. The mapping may be accomplished via a table, or 
other known data structure. The mapping is preferably 
stored in a non-volatile memory area, such as non-volatile 
RAM. 

For example, suppose that an initial mapping of a four 
processor system is as shown in FIG. 4A. As shown, 
physical processor 0 is identified as logical processor 0, 
physical processor 1 is identified as logical processor 1, 
physical processor 2 is identified as logical processor 2, and 
physical processor 3 is identified as logical processor 3. The 
method illustrated in FIG. 3 may use the logical identifiers 
for the processor numbers. In the example shown in FIG. 
4A, each physical processor is mapped to the same logical 
identifier. Thus, if processor 0 is a working processor, it will 
most likely become the master processor, as preference is 
given to lower numbered processors. 

If it is desired to have another processor be the master 
processor, the mapping table can be changed. An example is 
depicted in FIG. 4B. As shown in FIG. 4B, physical pro- 
cessor 0 is mapped to logical processor 2, physical processor 
1 is mapped to logical processor 3, physical processor 2 is 
mapped to logical processor 0, and physical processor 3 is 
mapped to logical processor 1. In this example, physical 
processor 2 is most likely to become the master processor, 
so long as it is a working processor, because it has the lowest 
numbered logical identifier. 

Exemplary pseudo-code for executing a method of the 
present invention is shown below. 
Assumptions 

1 . The SMP supports n processors. 

2. There are 2n+l memory cells, beginning with cell 0. 

3. Each processor has a unique identifier x, where 0<-x<- 
n-1. 

4. Processor x owns two of the memory cells, namely cell 
number x and x+n. 

5. Cells 0 through n-1 are primary cells. 

6. Cells n through 2n-l are alternate cells. 

7. Cell number 2n is the global "master chosen" flag. 

8. W(x,y, value) denotes a memory write by processor x to 
cell y with data "value." 

9. R(x,y) denotes a memory read by processor x from cell y. 

10. PRI_SUM(x)=R(x,0) for x=0, and R(x ( 0)+ . . . +R(x,x) 
for x>0. 

11. ALT_SUM(x)oR(x,n)+ . . . +R(x,2n-1). 

12. Each processor executes the steps below in the order 
given. However, one or more processors may be execut- 
ing the steps at the same time. 



Pseudo-code for each processor step 

1. W(x,x,l); R(x,x); 

2. If (0 != R(x,2n)) then goto "become_a_slave"; 

3. If (1 !- PRI_SUM(x)) then goto "become_a_slave"; 

4. W(x,n+x,l); R(x,n+x); 

5. If (1 ALT_SUM(x)) then 
W(x,2n,l); R(x,2n); 
Else 

W(x,n+x,0); R(x,n+x); goto step 2; 
Endif 

Although the invention has been described with a certain 
degree of particularity, it should be recognized that elements 
thereof may be altered by persons skilled in the art without 
departing from the spirit and scope of the invention. One of 
the preferred implementations of the invention is as sets of 
instructions resident in the random access memory 16 of one 
or more computer systems configured generally as described 
in FIG. 1. Until required by the computer system, the set of 
instructions may be stored in another computer readable 
memory, for example in a hard disk drive, or in a removable 
memory such as an optical disk for eventual use in a 
CD-ROM drive or a floppy disk for eventual use in a floppy 
disk drive. Further, the set of instructions can be stored in the 
memory of another computer and transmitted over a local 
area network or a wide area network, such as the Internet, 
when desired by the user. One skilled in the art would 
appreciate that the physical storage of the sets of instructions 
physically changes the medium upon which it is stored 
electrically, magnetically, or chemically so that the medium 
carries computer readable information. The invention is 
limited only by the following claims and their equivalents. 
What is claimed is: 

1. A method for determining a master processor in an 
information handling system containing a plurality of 
processors, comprising the steps of: 

assigning a logical identifier to each processor, wherein 
the logical identifier is unique for each processor, and 
wherein the logical identifier determines a priority for 
each processor, 
bringing each processor on-line; and 
deciding, by each processor, in response to being brought 
on-line, if it is the master processor, wherein said 
deciding step includes the steps of: 
indicating, by each processor, that it is functioning, 
wherein said indicating includes writing a first indi- 
cator into a first designated memory area, wherein 
each processor has a unique first designated memory 
area; 

determining if the master processor has already been 
selected, wherein said determining includes check- 
ing a global flag; 
if the master processor has not already been selected, 
determining if one or more higher priority processors 
is functioning, wherein said determining includes 
checking the first designated memory area of each 
higher priority processor; 
if no higher priority processor is functioning, then 
committing to be the master processor, wherein said 
60 committing includes writing a second indicator into 

a second designated memory area, wherein each 
processor has a unique second designated memory 
area; 

determining if one or more other processors has com- 
65 mitted to be the master processor, wherein said 

determining includes checking the second desig- 
nated memory area of each of the other processors; 
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if one or more other processors has committed to be the 
master processor, de -committing to be the master 
processor; and 

if no other processor has committed to be the master 
processor, indicating that the master processor has 5 
been determined, wherein said indicating includes 
setting the global flag. 

2. A method according to claim 1, wherein said assigning 
step comprises the step of mapping a physical identifier for 
each processor to a corresponding logical identifier. 10 

3. A method according to claim 2, wherein said mapping 
step further comprises the step of storing the physical 
identifier for each processor, along with the corresponding 
logical identifier, in a table. 

4. A method according to claim 3, wherein the table 15 
resides in non-volatile random access memory. 

5. A method according to claim 1, wherein said step of 
checking the first designated memory area of each higher 
priority processor further comprises the steps of: 

if one or more higher priority processors has written the 20 
first indicator into its first designated memory area, 
then determining that a higher priority processor is 
functioning; and 

if no higher priority processor has written the first indi- 
cator into its first designated memory area, then deter- 25 
mining that no higher priority processor, is functioning. 

6. A method according to claim 1, wherein said step of 
checking the second designated memory area of each of the 
other processors further comprises the step of: ^ 

if one or more other processors has written the second 
indicator into its second designated memory area, then 
determining that one or more other processors has 
committed to be the master processor. 

7. A method according to claim 1, wherein said step of 35 
de-committing comprises the step of removing the second 
indicator from the second designated memory area. 

8. An information handling system, comprising: 
a plurality of processors; 

means for assigning a logical identifier to each processor, 40 
wherein the logical identifier is unique for each 
processor, and wherein the logical identifier determines 
a priority for each processor; 
means for bringing each processor on-line; and 
means for deciding, by each processor, in response to 45 
being brought on-line, if it is the master processor, 
wherein said means for deciding includes: 
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means for indicating, by each processor, that it is 
functioning, wherein said means for indicating 
includes means for writing a first indicator into a first 
designated memory area, wherein each processor has 
a unique first designated memory area; 

means for determining if the master processor has 
already been selected, wherein said means for deter- 
mining includes means for checking a global flag; 

means for determining if one or more higher priority 
processors is functioning, wherein said means for 
determining includes means for checking the first 
designated memory area of each higher priority 
processor; 

means for committing to be the master processor, 
wherein said means for committing includes means 
for writing a second indicator into a second desig- 
nated memory area, wherein each processor has a 
unique second designated memory area; 

means for determining if one or more other processors 
has committed to be the master processor, wherein 
said means for determining includes checking the 
second designated memory area of each of the other 
processors; 

means for de-committing to be the master processor; 
and 

means for indicating that the master processor has been 
determined, wherein said means for indicating 
includes means for setting the global flag. 

9. An information handling system according to claim 8, 
wherein said means for assigning comprises means for 
mapping a physical identifier for each processor to a corre- 
sponding logical identifier. 

10. An information handling system according to claim 9, 
wherein said means for mapping further comprises a table 
for storing the physical identifier for each processor, along 
with the corresponding logical identifier. 

U. An information handling system according to claim 
10, wherein said table resides in non-volatile random access 
memory. 

12. An information handling system according to claim 8, 
wherein said means for de-committing comprises means for 
removing the second indicator from the second designated 
memory area. 
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